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Abstract. Using data on the Berlin public transport network, the pre- 
sent study extends previous observations of fractality within public 
transport routes by showing that also the distribution of inter-station 
distances along routes displays non-trivial power law behaviour. This 
indicates that the routes may in part also be described as Levy-flights. 
The latter property may result from the fact that the routes are planned 
to adapt to fluctuating demand densities throughout the served area. 
We also relate this to optimization properties of Levy flights. 

1 Introduction 

X, 

4h . Transit networks are fascinating objects to study. While each network and every 

element in the network needs to be adapted to a specific situation in the urban 
agglomeration, it turns out that nonetheless general construction principles prevail. 
The latter fact has been evidenced by numerous studies using in particular methods of 
statistical physics, random graph theory, and other approaches. One of the main fields 
contributing to this analysis is complex network science [112] (see [314] for a review). 
Applying the tools of complex network theory it was shown, that transit networks (in 
this paper we consider urban public transportation networks, PTN) share common 
features of other natural and man-made complex networks. In particular, they are 
strongly correlated compact structures (so-called small worlds) [1 , that often possess 
power law scale-free behaviour [5]- It has been shown, that they are resilient against 
random failures while being vulnerable to targeted attacks [416] . The study of fractal 
properties of PTN has a long history |7l8l9ll0fTTj . The prevailing part of these former 
studies concern the density of stations or the total length of track as function of 
the distance from the center of the network. In this way PTNs and rail networks 
of various cities and regions were analyzed. Thibault and Marchand [7] studied the 
fractal properties of different subnetworks within Lyon (regions 1,11, III). The fractal 
dimensions found were for the suburban rail I: 1.64, II: 1.66, III: 1.88; for the public bus 
service: 1.00, 1.09, 1.45; the authors also analyse the drainage utility with results: 1.21, 
1.30, and 1.79. Frankhauser [5] obtained 1.58 for Stuttgart's rail system. Benguigui 
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and Daoud [5] obtained the fractal dimension for the Paris railway transportation 
network as 1.47. In relation to this result they also discuss DLA and star polymers 
as known models that lead to fractal structure. Further results for fractal dimensions 
were given by Benguigui [TU] for several Rhine towns and Moscow railways: 1.70±0.05, 
Paris metro: 1.80 ± 0.05. The fractal dimension of Seul PTN has been estimated as 
1.50 for stations and 1.35 for tracks [TT]. In a complementary approach, the present 
authors measured the mean distance as a function of number of stations traveled [5] . 
For different Berlin subnetworks a fractal dimension in the range of 1.04 - 1.22 was 
found. The Paris metro data leads to a fractal dimension 1.22 when excluding short 
distance contributions. 

Some of the results cited above appear to be compatible with the interpretation 
of the routes as two dimensional self-avoiding walks (SAWs) with a fractal dimension 
of 4/3 [T2]. However, the results appear to vary considerably. Here, in addition to 
the fractal dimension of the transit routes we intend to analyse the distributions of 
inter-station distances of consecutive stations. We will show that these distributions 
appear to have heavy (power-law) tails compatible with a Levy-flight model for the 
transit routes. The latter approach further presents a possible explanation for the 
deviations from the SAW behaviour observed. 

A Levy-flight model is characterised by the distribution function of the step lengths 
I of the otherwise random successive steps. For a specific Levy-flight the complemen- 
tary (the 'tail') of the cumulative distribution function P(£) of the step lengths I 
follows a power law decay 

P(£)~£-». (1) 

where the exponent /i characterises the distribution. Note that that exponents greater 
than 2 belong to the Gaussian Central Limit Theorem (CLT) domain of attraction 
due to the fact that for an exponent \i > 2 the probability density function (PDF) 
has finite variance. Exponents fi < 2, however, belong to the Levy CLT domain of 
attraction. The critical exponent fi — 2 is more delicate and requires the analysis of 
the growth of the associated truncated variance function. 

In the following, we first describe the empirical data as they were drawn from 
the Berlin PTN. Then, we analyze the distribution of the inter-station distances 
along the routes and propose a description of PTN routes and journeys in terms of 
self-avoiding Levy flights. This is supported by the fact that the scaling of distance 
traveled as function of the number of stations deviates from SAWs but is in line with 
Levy flight-like behaviour. 



2 Data analysis 

PTNs are often discussed without reference to their geographical embedding. The 
fact that this subject is left aside by most studies of PTNs with respect to their 
complex network behaviour, is due mainly to the lack of easily accessible data on the 
locations of stations and routes. For the present work we have analyzed such data for 
stations of the Berlin PTN. Our database consists of 2992 stations that belong to 211 
routes. Furthermore, we have taken into account different means of public transit, 
that include subway (U-bahn), tram, bus, and surface high speed transport (S-bahn). 
The data was collected in the frames of a larger project [3 5 including a compara- 
tive analysis of PTN vulnerability of fourteen large cities |4l6j . Here, we complement 
available information about the Berlin PTN topology by the information about the 
geographical location of each single station. To this end, Cartesian coordinates for 
each station were extracted from an on-line map provided by the Berlin public trans- 
port operator. We note that using this procedure slight deviations between distances 
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Fig. 1. Cumulative distribution for Berlin PTN: separate distributions for each means of 
transportation (a) and the sum of these distributions (b). The fit of the log- log plot in (b) 
corresponds to an exponent value of /i = 2 ± 0.05. 



calculated on the spherical surface and those derived from the projection to a flat 
surface with Cartesian coordinates occur. These will be neglected below. With the 
data on the topological and geographical properties of the PTN at hand we are in 
the position to analyze how these properties are related. 

A short glance at the map of a public transport network immediately reveals that 
transport lines do in general not follow the shortest "straight ahead" route between 
their two end points. Thus the question arises if there is any other underlying struc- 
ture or principle characterizing the observed behaviour of geographically embedded 
transport routes. Let us analyse transport routes with respect to their fractal proper- 
ties in terms of random walks, self-avoiding walks and Levy flights. In former studies 
[5] we have analyzed the dependence of the mean square distance between stations 
(R 2 ) as function of the number N of stations traveled. The following power law scaling 
was found: 

(i? 2 ) ~ N 2v (2) 

with the exponent v ranging from v = 0.82 for bus routes to v — 0.9 and 0.96 for 
subway and tram routes. The S-bahn data is distorted due to a ring structure within 
this sub-network. The obtained values of the exponent lie in the region 3/4 < v < 1. 
These results are to be compared with the behaviour of self-avoiding walks (SAW) in 
two dimensions characterised by an exponent v = 3/4 = 0.75 which corresponds to 
the exact result for the fractal dimension of D$aw = 4/3 [T2]. This observation leads 
to the hypothesis that the PTN routes may follow SAW statistics. 

To further analyze the situation, let us explore in addition the statistics of the 
inter-station distances. In Fig. [1] we plot the cumulative distribution P(£) of the 
distances I between consecutive stations along the routes showing individual curves 
for each mode of transport as well as an integral curve including all modes (Fig. 
[T^.). As follows from the linear regions of the corresponding log- log plots, the region 
of power law behaviour does not extend much more than one order of magnitude 
for each single mode of transport. In the combined curve, this power law regime is 
expanded. The slopes and thus the corresponding exponents are very similar for the 
bus and S-bahn routes. As one can deduce from the plot, the contribution of tram 
and U-bahn to the combined curve is rather weak. In Fig. Q]b we show the fit of the 
combined curve to the power law 

P{i) ~ r" (3) 

with an appropriate exponent value fi = 2 ± 0.05. Our finding is that at least in 
part the distribution P(£) displays scale-free properties. In the following, we want to 
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connect these scale-free properties observed for large scales with the behaviour of the 
distribution for smaller distances I. To this end we project the distance vectors £ to 
the x and y axes and analyze the probability density functions (PDF) p x (x), p y {y). 
Under the hypothesis that the tails of the distributions are governed by a power law 
we may expect that the distributions p x {%), Py{y) m &y be described by a Levy te- 
stable distribution. To fix notation we here specify the characteristic function of the 
a-stable Levy distribution 

r / Tra\ 

f(t; a, f3, r, c) = exp y — c a \t\ a yl — A/3sign(£) tan — J + i-rt)J , (a ^ 1), (4) 

with shape, skewness, location, and scale parameters a, /3, r, and c. The asymptotic 
behaviour of the Levy a-stable distribution function F(x; a, j3, r, c) for a < 2 is then 
given by [13]: 

P{x) ~ c a d(l + (3)x~ a , with d = sin(wa/2)r(a)/iT. (5) 

This relates the exponents, a = //. Let us emphasize that the above fit concerns 
the entire distribution function, and therefore it takes into account the behaviour at 
smaller values of I. This is complementary to fitting P(£), where only the asymptotic 
(complementary) behaviour is taken into account. 

In Fig. [5] we fit the empirically determined PDFs p x (x), p y {y) by an a-stable 
distribution [T3]. The best fits are obtained for values a = 1.71 for the ^-projection 
and a = 1.67 for the y-projection. These values are to be compared with the value 
of fi derived above. Recall that for a perfect a-stable distribution the relation a = /i 
should hold. The latter holds in our case only approximately. While one may not 
expect high precision from the given data, the obtained values of a and //. support 
the conjecture that the inter-station distances are governed by a non-trivial a-stable 
distribution. 

In further support of the fit to a Levy-stable distribution as described above, we 
show in Fig. [3] the so-called percentile plots for the cumulative probability density 
functions (CDF) defined as: 

P(x) = [ p{x')dx'. (6) 

J — oo 

In the light of these results, let us now re-examine the data on the end-to-end 
mean square inter-station distance behaviour, cf. Eq. ([2]). As mentioned above, the 
power law scaling with the exponent v has originally been interpreted in terms of 
SAW behaviour in 2d. Note as well, that the most reliable fit was achieved for the 
bus subnetwork which also provides the largest data set and resulted in an exponent 
v = 0.82. Let us re-examine this value assuming as a working hypothesis a Levy 
flight like behaviour for the inter-station distances. Analytical and numerical results 
available for self-avoiding Levy flights show that their scaling exponents differ from 
the usual SAW exponents |TS]- In particular, for the node-avoiding Levy flight the 
exponent v is known in the form of an e = (2fi — (i)-expansion |16I17] : 

2 



Flory-type arguments on the other hand lead to |15j : 

_ / 3/(/i + d), d<d c = 2fx, 
V - \ 1/fi, d > d c . W 

Substituting the values fi = 1.71, \i = 1.67 into Eq. we get v — 0.80, v — 0.81, 
whereas following Eq. © we arrive at v = 0.81, v = 0.82, both compatible with the 
observed behaviour of the bus-subnetwork. 





c. d. 

Fig. 2. Linear plots of the probability density functions (PDF) for all means of transport 
projected to the x- and y- axes (left), lin-log plots of the same (right) fitted to the a-stable 
distribution with a = 1.71 for the a>plots and a = 1.67 for the y-plots. 



3 Conclusions and outlook 



The present study extends previous observations of fractality within public transport 
routes [5171819110111] by showing that also the distribution of inter-station distances 
along routes displays non-trivial power law behaviour. This indicates that the routes 
may in part also be described as Levy-flights. The latter property may result from the 
fact that the routes are planned to adapt to fluctuating demand densities throughout 
the served area but also possibly related to optimisation properties [18] . 

One may object, that the distribution derived above mixes different means of 
transport with different mean inter-station distances. However, rather observing jour- 
neys of individual passengers one may acknowledge that they will in general use two or 
more means of transport to complete their journeys. The given combined distribution 
of inter-station distances allows them to perform this journey such that it resembles 
a Levy flight. This may allow for travel optimization. 

Self-avoiding Levy flights, apart from observing the constraint of non self-intersec- 
tion, evolve randomly. The fact that PT routes at least within the present sample 
appear to display the same scaling symmetry is quite unexpected. In particular, this 
behavior seems to be at odds with the requirement of minimizing passengers traveling 
time between origin and destination. The latter argument, however, ignores the time 
passengers spend walking to the initial and from the final stations. Including these, 
one understands the need for the routes to cover larger areas by meandering through 
neighbourhoods. Given the requirements for a PTN to cover a metropolitan area with 
a limited number of routes while simultaneously offering fast transport across the city 
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c. d. 

Fig. 3. Percentile plots of the cumulative a-stable probability density distribution function 
(CDF) in comparison to the data. Distributions are shown in linear scale for the step-length 
projected to the :r-axis (Figs 3a,b) with exponent a — 1.71 and to the y-axis (Figs 3c, d) 
with exponent a — 1.67. Figs. 3(a,c) display the behaviour of the 25%-100% percentile while 
Figs. 3(b,d) display the 0%-75% percentile. 

one may speculate that routes scaling like self-avoiding Levy nights may present an 
optimal solution |4I19] . 
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